Goto

Collaborating Authors

 Dover


Going Whole Hog: A Philosophical Defense of AI Cognition

Cappelen, Herman, Dever, Josh

arXiv.org Artificial Intelligence

This work defends the 'Whole Hog Thesis': sophisticated Large Language Models (LLMs) like ChatGPT are full-blown linguistic and cognitive agents, possessing understanding, beliefs, desires, knowledge, and intentions. We argue against prevailing methodologies in AI philosophy, rejecting starting points based on low-level computational details ('Just an X' fallacy) or pre-existing theories of mind. Instead, we advocate starting with simple, high-level observations of LLM behavior (e.g., answering questions, making suggestions) -- defending this data against charges of metaphor, loose talk, or pretense. From these observations, we employ 'Holistic Network Assumptions' -- plausible connections between mental capacities (e.g., answering implies knowledge, knowledge implies belief, action implies intention) -- to argue for the full suite of cognitive states. We systematically rebut objections based on LLM failures (hallucinations, planning/reasoning errors), arguing these don't preclude agency, often mirroring human fallibility. We address numerous 'Games of Lacks', arguing that LLMs do not lack purported necessary conditions for cognition (e.g., semantic grounding, embodiment, justification, intrinsic intentionality) or that these conditions are not truly necessary, often relying on anti-discriminatory arguments comparing LLMs to diverse human capacities. Our approach is evidential, not functionalist, and deliberately excludes consciousness. We conclude by speculating on the possibility of LLMs possessing 'alien' contents beyond human conceptual schemes.


Hyperbolic Image-Text Representations

Desai, Karan, Nickel, Maximilian, Rajpurohit, Tanmay, Johnson, Justin, Vedantam, Ramakrishna

arXiv.org Artificial Intelligence

Visual and linguistic concepts naturally organize themselves in a hierarchy, where a textual concept "dog" entails all images that contain dogs. Despite being intuitive, current large-scale vision and language models such as CLIP do not explicitly capture such hierarchy. We propose MERU, a contrastive model that yields hyperbolic representations of images and text. Hyperbolic spaces have suitable geometric properties to embed tree-like data, so MERU can better capture the underlying hierarchy in image-text datasets. Our results show that MERU learns a highly interpretable and structured representation space while being competitive with CLIP's performance on standard multi-modal tasks like image classification and image-text retrieval.


Launching Apple, Gmail, And A Harvard-IBM Robot Super-Brain

Forbes - Tech

This week's milestones in the history of technology include the birth of Apple Computer, the first release of Gmail, and IBM signing an agreement with Harvard to build one of the earliest computers, the Automatic Sequence Controlled Calculator (ASCC), later called Mark I. Guglielmo Marconi receives the first wireless signal transmitted across the English Channel, sent from Wimereux, France, to his ship-to-shore station at the South Foreland Lighthouse outside Dover, England. The signal was a test held at the request of the French Government which was considering licensing the invention in France. Bell Telephone Laboratories announces the invention of the phototransistor, a transistor operated by light rather than electric current, invented by John Northrup Shive. An entirely new type of "electric eye" much smaller and sturdier than present photo-electric cells and possibly cheaper-has been invented at the Laboratories. During the past quarter century, electric eyes have found widespread use in electronics because of their ability to control electric currents by the action of light.


Development of a Cargo Screening Process Simulator: A First Approach

Siebers, Peer-Olaf, Sherman, Galina, Aickelin, Uwe

arXiv.org Artificial Intelligence

Some manufacturers provide benchmarks for individual sensors but we found no benchmarks that take a holistic view of the overall screening procedures and no benchmarks that take operator variability into account. Just adding up resources and manpower used is not an effective way for assessing systems where human decision-making and operator compliance to rules play a vital role. Our aim is to develop a decision support tool (cargo-screening system simulator) that will map the right technology and manpower to the right commodity-threat combination in order to maximise detection rates. In this paper we present our ideas for developing such a system and highlight the research challenges we have identified. Then we introduce our first case study and report on the progress we have made so far. Keywords: port security, cargo screening, modelling and simulation, decision support, detection rate matrix 1. INTRODUCTION The primary goal of cargo screening at sea ports and air ports is to detect human stowaways, conventional, nuclear, chemical and radiological weapons and other potential threats. This is an extremely difficult task due to the sheer volume of cargo being moved through ports between countries. For example in sea freight, 200 million containers are moved through 220 ports around the globe every year; this is 90% of all non bulk sea cargo (Dorndorf, Herbers, Panascia, and Zimmermann 2007). Little is known about the efficiency of current cargo screening processes as few benchmarks exist against which they could be measured (e.g.